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@ This invention provides an amyloid precursor protein-deaving protein and related nucleic acid 
compounds. The invention also provides methods, materials and assays. The compounds of this 
invention will further the characterization of neurological diseases such as Alzheimer's disease and 
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A peptide of 42 to 43 resiaues known as the p-amyloid peptide (^A4) has been implicated in Alzheimer's 
disease and Down's syndrome. Researchers hypothesize that abnormal accumulation of this 4 kilodalton (kd) 
protein in the brain is due to cleavage of a larger precursor protein, called amyloid precursor protein (APP). 
Normal cleavage of APP occurs within the A4 region, indfcating that an alternate cleavage event occurs when 
5 the normal full length is generated. The amino terminal residue of pt/A4 is most often an aspartic acid (Asp), 
indicating that a protease which deaves between the methtonine (Met) at position 596 [Metsge using the num- 
bering system according to J. Kang, etal.. Nature 325:733 (1987).! and ASP597 of APP would generate amyloid. 
Therefore, proteases which deave the APP so as to generate p/A4 are important tools for characterizing Alz- 
heimer's disease and Down's syndrome. 
10 In the past, researchers have attempted to characterize the abnormal deavage event through the use of 
dassical protein purification techniques. These investigations have resulted In reports of a partially purified 
68 kDodalton protease which deaves at a Met-Asp bond of a synthetk; peptide. C. Abraham, etal. . Neurobtdogy 
of Aging 11 A : 303 (1 990). In 1991, Abraham and co-workers, compared the deavage pattern of the 68 kd pro- 
tease with known serine proteases. C. Abraham, et al. . Biochemical and Biophysical Research Communica- 
15 tions, 174:790 (1991). Subsequently, the same researchers reported that the activity seen in the prior studies 
was actually the action of two independent proteases. One was identified as a calcium-dependent serine pro- 
tease and the other a cysteine metal loprotease. C. Abraham, et al. . Journal of Cellular Biochemistry. 15:115 
(1991); C. Abraham, et al.. Journal of Neurochemistry. 57:5109 (1991). No structure or characterization of 
these proteases was disclosed. 
20 The present invention provides a new enzyme which is structurally different from those previously descri- 

bed and which will deave APP to generate amyloidogenic fragments of the size expected of a Mel69e-Asps97 
deavage. Thus, the new enzyme is very useful in furthering the characterization of Alzheimer's disease and 
Down's syndrome. Moreover, use of the invention may result in treatments for these or other related diseases. 

To date there has been no satisfactory means of diagnosing Alzheimer's disease In a person until the de- 
25 mentia completely manifests itself. Confirmation of the dementia as having arisen from Alzheimer's disease 
requires a post-mortem examination of the brain of the afflicted patient The instant invention provides a means 
of determining those patients having Alzheimer's disease or a propensity of developing Alzheimer's disease 
while such patients are still alive. 

For purposes of darity and as an aid in understanding the invention, as disdosed and daimed herein, the 
30 following items are defined below. 

"293 cells" refers to a widely available transformed human priamry embryonal kidney ceil line, as described 
in F.L Graham, etal. . Journal of General Virology. 36:59-72 (1977). This cell line may be obtained, for example, 
from the American Type Culture Collection, 12301 Parklawn Drive, Rockville, Maryland. 10852-1776 (ATCC), 
under the accession number ATCC CRL 1 573. 
35 "AVI 2 cells" refers to another widely available cell line which may be obtained from the ATCC under the 
accession number ATCC CRL 9595. 

"Amyloidogenic fragment" - An APP fragment comprising the p/A4 peptide. 

"Functional compound of SEQ ID NO:!" - A compound comprising SEQ ID NO:1 which is capable of deav- 
ing APP. 

40 "Kunitz-like domain" - A protease inhibitor similar to soybean trypsin inhibitor or a nudeic acid sequence 

encoding a protease inhibitor which is similar to the soybean trypsin inhibitor. For example, the Kunitz Protease 
Inhibitor (KPI) region of APP as described in P. Ponte, et al. . Nature 331:525 (1988), or R.E. Tanzi. et al.. Na- 
Ujre.331:528 (1988). or N. Kitagucht. et al.. Nature. 331:530 (1988) is a Kunitz-like domain. 

"pRc/Zyme"- Amodif led pRc/CMVeukaryolic expression vector, the pRc/CMV vector being available com- 

45 merically (Invitrogen Corporation, 3985 Sorrento -Valley BWd., Suite B, San Diego, California 92121). The plas- 
mid pRc/Zyme comprises a human cytomegalovirus promoter and enhancer, a bovine growth hormone poly- 
adenylatton signal, a neomycin resistance gene, a beta-lactamase gene useful as an ampicillin resistance 
marker in E. coli, and many other features as described in the 1991 Invitrogen Catalog, page 29, as well as a 
Notl/Sall insert of 1451 base pairs which contains an entire Zyme coding region. 

50 "pSZyme" - A modified E.coli doning vector pSPORT-1™ [described in E.Y. Chen, et al. . DNA , 4:165 

(1 985)1. the plasmid pSPORT-1 ™ being commercially available (Gibco-BRL 8400 Helgerman Court, Gaithere- 
burg, Maryland 20877). This plasmid contains an origin of replication from a pUC vector, this plasmid being 
described in C. Yanisch-Perron, et al., Gene, 33:103-119 (1985); the beta-lactamase gene which confere am- 
picillin resistance; a Notl/Sall insert of 1451 base pairs which contains an entire coding region of Zyme; as well 

55 as other features. 

"Part of SEQ ID N0:1" - At least 6 consecutive amino acid residues of SEQ ID N0:1. 
"mRNA"-ribonudeicacid (RNA) which has been transcribed either in vivo or in vitro, indud in g, for example, 
RNA transcripts prepared In vitro by transcription of coding sequences of DNA by RNA polymerase. 
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■SEQ ID NO: 1 or a functional equivalent thereof* 
- SEQ ID N0:1 or a conservative alteration of the amino 
acid sequence of SEQ ID N0:1, wherein the conservative 
alteration results in a compound which exhibits 
substantially the same biological, biochemical, chemical, 
physical and structural qualities of SEQ ID:1. 

"SEQ ID NO: 3" - The DNA sequence atg gct ogc ggc 

ATC ATA GTC AGG G. 

"SEQ ID NO: 4" - The DNA sequence aac cga atc ttc 

AGG TCT TCC TGG GG. 

"SEQ ID NO: 5" - The DNA sequence tcg ctc tct cct 

GGG GAC ACA GA. 

"SEQ ID NO: 6" -The DNA sequence ccA ggt gct att cca 

TGT ATG TCA TAG. 

"SEQ ID NO: 7" -The DNA sequence tct gtg tcc cca gga 

gag AGC GA. 

"SEQ ID NO: 8" -The DNA sequence ATA GTC AAG CTC TCT 

tct caa t. 



"Transfection" - any transfer of nucleic acid into a host cell, with or without integration of said nucleic acid 
into genome of said host cell. 

"Zyme" • the amino acid sequence SEQ ID N0:1 or a functional equivalent thereof. 

"Zyme-related band configuration" - One of two band configurations chosen from two band configurations 
35 of a herein disclosed restriction fragment polymorphism. One pattern displays a 2400 base pair band, but no 
2500 base pair band. The other pattern displays a 2500 band, but no 2400 base pair band. 

The present invention provides amino acid compounds which comprise the amino acid sequence 

Met Lys Lys Leu Met Val Val Leu Ser Leu He Ala Ala Ala Trp Ala 
^ 1 S 10 15 

Glu Glu Gin Asn Lys Leu Val His Gly Gly Pro Cys Asp Lys Thr Ser 
20 25 30 

45 
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His Pro Tyr Gin Ala Ala Leu Tyr Thr Ser Gly His Leu Leu Cys Gly 
35 40 45 



Gly Val Leu He His Pro Leu Trp Val Leu Thr Ala Ala His Cys Lys 
50 55 60 



Lys Pro Asn Leu Gin Val Phe Leu Gly Lys His Asn Leu Arg Gin Arg 
65 70 75 80 

Glu Ser Ser Gin Glu Gin Ser Ser Val Val Arg Ala Val He His Pro 
85 90 95 



15 



Asp Tyr Asp Ala Ala Ser His Asp Gin Asp He Met Leu Leu Arg Leu 
100 105 110 



Ala Arg Pro Ala Lys Leu Ser Glu Leu He Gin Pro Leu Pro Leu Glu 
115 120 125 

Arg Asp Cys Ser Ala Asn Thr Thr Ser Cys His He Leu Gly Trp Gly 
^ 130 135 140 

Lys Thr Ala Asp Gly Asp Phe Pro Asp Thr He Gin Cys Ala Tyr He 
145 150 155 160 

25 His Leu Val Ser Arg Glu Glu Cys Glu His Ala Tyr Pro Gly Gin He 

165 170 175 

Thr Gin Asn Met Leu Cys Ala Gly Asp Glu Lys Tyr Gly Lys Asp Ser 
180 185 190 

30 Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Gly Asp His Leu Arg 

195 200 205 



35 



Gly Leu Val Ser Trp Gly Asn He Pro Cys Gly Ser Lys Glu Lys Pro 
210 215 220 

Gly Val Tyr Thr Asn Val Cys Arg Tyr Thr Asn Trp He Gin Lys Thr 

225 230 235 240 



40 



He Gin Ala Lys. 

244 



hereinafter defined as SEQ ID N 0:1, or a functional equivalent thereof. In particular, the annino acid compound 
which is SEQ ID N 0:1 is preferred. 

Those in the art will recognize that some alterations of SEQ ID NO:1 will fall to change the function of the 
45 amino acid compound. For instance, some hydrophobic amino acids may be exchanged for other hydrophobic 
amino acids, amino acids with similar side chains may be interchanged, basic amino acids may be interchanged 
with other basic amino adds, acidic amino acids may be interchanged with other acidic amino acids, small 
amino acids may be interchanged with other small amino acids or various other conservative changes nnay 
be made. Those altered amino add compounds which confer substantially the same function in substantially 
50 the same manner as the exemplified amino acid compound are also encompassed within the present invention. 

Artisans wPI also recognize that this protein can be synthesized by a number of different methods. All of 
the amino acid compounds of the invention can be made by chemical methods well known in the art, induding 
sdid phase peptide synthesis or recombinant methods. Both methods are described in U.S. Patent 4,61 7, 149. 
Recombinant methods are preferred if a high yield is desired. A general method for the construction of any 
55 desired DNA sequence is provided in Brown, etal.. Methods in EnzynrK)logy . 68:109 (1979). 

Other routes of production are well known. Expression in eucaryotic cells can be achieved via SEQ ID 
N0:2, described infra. For example, the amino add compounds can be produced in eucaryotic cells using sim- 
ian virus 40, cytomegalovirus, or mouse mammary tumor virus-derived expression vectors comprising DNA 
which encodes SEQ ID N0:1. As is well known in the art, some viruses are also appropriate vectors. For ex- 
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ample, the adenovirus, the vaccinia virus, the herpes virus, the baculovirus, and the rous sarconna virus are 
useful. Such a method is described in U.S. Patent 4,775,624. Several alternate methods of expression are de- 
scribed in J. Sambrook, et al. . Molecular Cloning: A Laboratory Manual . Chapters 16 and 17 (1989). 

5 In another embodiment, the present invention encompasses nucleic acid compounds which comprise nu- 

cleic acid sequences encoding SEQ ID N0:1. As skilled artisans recognize, the amino acid compounds of the 
invention can be encoded by a multitude of different nucleic acid sequences due to the degeneracy of the ge- 
netic code, wherein most of the amino acids are encoded by more than one nuclei ackj triplet Because these 
alternate nucleic acid sequences would encode the same amino acid sequences, the present invention further 

10 comprises these alternate nucleic acid sequences. Preferably, the nucleic acid compound is DNA, sense or 
antisense mRNA. A most preferred embodiment of a DNA compound which encodes Zyme has this sequence: 
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ATGAAGAAGC 


TGATGGTGGT 


GCTGAGTCTG 


ATTGCTGCAG 


CCTGGGCAGA 


50 


GGAGCAGAAT 


AAGTTGGTGC 


ATGGCGGACC 


CTGCGACAAG 


ACATCTCACC 


100 


w X ^\ w w ,/%#^ V 


TGCCCTCTAC 


ACCTCGGGCC 


ACTTGCTCTG 


TGGTGGGGTC 


150 


CTTATCCATC 


CACTGTGGGT 


CCTCACAGCT 


GCCCACTGCA 


AAAAACCGAA 


200 


TCTTCAGGTC 


TTCCTGGGGA 


AGCATAACCT 


TCGGCAAAGG 


GAGAGTTCCC 


250 


AGGAGCAGAG 


TTCTGTTGTC 


CGGGCTGTGA 


TCCACCCTGA 


CTATGATGCC 


300 


GCCAGCCATG 


ACCAGGACAT 


CATGCTGTTG 


CGCCTGGCAC 


GCCCAGCCAA 


350 


ACTCTCTGAA 


CTCATCCAGC 


CCCTTCCCCT 


GGAGAGGGAC 


TGCTCAGCCA 


400 


ACACCACCAG 


CTGCCACATC 


CTGGGCTGGG 


GCAAGACAGC 


AGATGGTGAT 


450 


TTCCCTGACA 


CCATCCAGTG 


TGCATACATC 


CACCTGGTGT 


CCCGTGAGGA 


500 


GTGTGAGCAT 


GCCTACCCTG 


GCCAGATCAC 


CCAGAACATG 


TTGTGTGCTG 


550 


GGGATGAGAA 


GTACGGGAAG 


GATTCCTGCC 


AGGGTGATTC 


TGGGOGTCCG 


600 


CTGGTATGTG 


GAGACCACCT 


CCGAGGCCTT GTGTCATGGG 


GTAACATCCC 


650 


CTGTGGATCA 


AAGGAGAAGC 


CAGGAGTCTA 


CACCAACGTC 


TGCAGATACA 


700 


CGAACTGGAT 


CCAAAAAACC 


ATTCAGGCCA 


AG 




732 



35 

which is hereinafter defined as SEQ ID NO:2. However, also preferred are those nucleic acid compounds which 
are sense and antisense mRNA. 

Also provided by the present invention are nucleic acid vectors comprising nucleic acids which encode SEQ 
ID N0:1 or a functional equivalent thereof. The preferred nucleic acid vectors are those which are DNA. Most 
40 preferred are DNA vectors which comprise the DNA sequence which is SEQ ID N0:2. An especially preferred 
DNA vector is the plasmid pSZyme. 

E. coli/ pSZyme. which contains a cloning vector comprising SEQ ID N0:2, was deposited and made part 
of the stock culture collection of the Northern Regional Research Laboratories (NRRL), Agricultural Research 
Service, U.S. Department of Agriculture, Peoria, Illinois, 61604 on April 29, 1992, underthe accession number 
45 NRRL B-18971. SEQ ID N0:2 can be isolated from the plasmid, for example, as a 1451 base pair Notl/Sall 
restriction fragment Other fragments are useful in obtaining SEQ ID N0:2. 

Additionally, the DNA sequences can be synthesized using commercially available automated DNA syn- 
thesizers, such as the ABS (Applied Biosystems, 850 Lincoln Centre Drive, Foster City, CA 94404) 380B DNA 
synthesizer. The DNA sequences can also be generated by the polymerase chain reaction (PGR) as described 
50 in U.S. Patent No. 4,889,81 8. 

Restrictton fragments of these vectors are also provided. The preferred firagnnents are the 1451 base pair 
Notl/Sall restriction fragment, the 803 base pair BsrBI/EspSl restrictran fragment and the 815 base pair Eco- 
Nl/Bfal restriction fragment of pSZyme. 

Moreover, DNA vectors of the present invention preferably comprise a pronnoter positioned to drive expres- 
S5 sion of SEQ ID N0:2, or a functional equivalent thereof. Those vectors wherein saki pronrK)ter functions in hu- 
man embryonic kidney cells (293 cells), AVI 2 cells, yeast cells, or Escherichia coli cells are preferred. The DNA 
expression vector most preferred is plasmid pRc/Zyme. 

The plasmid pSZyme, isolatable from E. coli using standard techniques, is readily modified to construct 
expression vectors that produce Zyme in a variety of organisms, including, for example, E. coli , yeast of the 
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cells derived from fall armyworm ovaries of the g^^^p 



family Saccharomycetes, af^BTo cells derived from fall armyworm ovaries of the ge^B spodoptera . (a com- 
monly used host for baculovirus expression systems). [Commonly used references, such as Sambrook etal. , 
supra, describe these techniques.] 

5 The current literature contains techniques for constructing AV12 expression vectors and for transfecting 

AV1 2 host cells. See, e.g. . U.S. Patent No. 4.992.373. The current literature also contains numerous techniques 
for constructing 293 expression vectors and for transfecting 293 host cells. 

The construction protocols utilized for 293 cells can be followed to construct analogous vectors for other 
cell lines, merely by substituting, if necessary, the appropriate regulatory elements using well known techni- 

10 ques. Promoters which may be used, for example, include the thymidine kinase promoter, the metallothionin 
promoter, the heat shock promoter, immunoglobulin promoter, or various viral promoters such as the mouse 
nummary tumor virus promoter, SV40 promoter, herpesvirus promoters, or the BK virus promoters. In addition, 
artificially constucted promoters, derived from "consensus" sequences or created as hybrids of other promoteis 
may be used in the course of practicing this invention. 

IS The DNA compounds of the present invention also include primers and probes. Nucleic acid compounds 

of at least 18 consecutive base pairs which encode SEQ ID N0:1 or a part thereof are included in the present 
invention. Probes or primers which are DNAare preferred. Most preferred probes or primers are: SEQ ID NO:3 
and SEQ ID N0:4. Those in the art will recognize the techniques associated with probes and primers as being 
well known. 

20 For example, all or part of SEQ ID NO:3 or SEQ ID N0:4 may be used to hybridize to the coding sequence. 

The full length sequence can then be generated using polymerase chain reaction (PGR) amplification, using 

well known techniques. The full length sequence can be subsequently subdoned into any vector of choice. 
Alternatively. SEQ ID N0:3 or SEQ ID N0:4 may be radioactively labeled at the 5' end in order to screen 

cDNA libraries by conventional means. Furthermore, any piece of Zynrte-encoding DNA which has been bound 
25 to a filter may be saturated with total mRNA transcripts, in order to reverse franscribe the mRN A transcripts 

which bind. 

Primers and probes may be obtained by means well known in the art. For example, once pSZyme Is iso- 
lated, restriction enzymes and subsequent gel separatton may be used to isolate the fragment of choice. 
Another embodiment of the present invention is a genomk; done of Zyme. The preferred genomic done 
30 is the 4.0 kilobase Hindi 1 1 fragment from a human chromosome 19 library which hybridizes to fragments of DNA 
which encode SEQ ID N0:1. This can be obtained via hybridization with SEQ ID N0:2, or parts thereof. For 
example, SEQ ID N0:3 and SEQ ID N0:4 may be radioactively labelled and used to probe a chromosome 19 
library, in order to then identify and isolate the corresponding genomic DNA. 

The present invention also provides an Alzheimer's diagnostic assay wherein donor human DNA is: 
35 1) digested with the resfriction enzyme Jag I; 

2) hybridized with labelled Zyme DNA to reveal a Zyme-related band configuration; and 

3) compared to the similarly-digested and hybridized band configurations of those members of the donor's 
family who display or displayed t he symptoms of Alzheimer's disease. The preferred Alzheimer's diagnostic 
assay utilizes a blood sample as the source of donor human DNA. 

« Since the genomic DNA is provided in the present invention and a Zyme-related restriction fragment length 
polymorphism is identified by the disdosure of this invention, the remainder of this prxx:edure may be accom- 
plished according to methods known in the art. For example, U.S. Patent 4.666.828. describes these proce- 
dures. [Numerous references, such as B. Lewin, Genes, at page 78 (1987), review restriction fragment length 
polymorphism techniques and theory.] 

45 Host cells which harbor the nudeic acids provided by the present invention are also encompassed within 
this invention. A preferred host ceil is an oocyte. A preferred oocyte is one which has been injected with sense 
mRNA or DNA compounds of the present invention. A still more preferred oocyte is one which has been injected 
with sense mRNAor DNA compounds of the present invention in conjunction with DNAor mRNA which encodes 
APR Most preferred oocytes of the present invention are those which have been injected vAth sense mRNA. 

50 Other preferred host cells are those which have been transfected with a vector which comprises SEQ ID 
NO:2. Preferred SEQ ID N0:2-transfected host cells indude indude 293. AV12. yeast and E. cdi cells. Most 
preferred 293 and E. coli host cells are 293/pRc/Zyme, E. coli/ pSZvme . 

Also preferred is a host cell which has been co-transfected with a DNA vector which comprises SEQ ID 
NO:2 and a DNA vector which comprises the coding sequence of APR 293 cells, AV12 cells, yeast cells and 

55 E. coli ceils are especially useful co-transfected host cells. 

An oocyte host cell can be constructed according to the procedure described in Lubbert, etal. . Proceedings 
of the Nation al Academy of Sciences (USA^. 84:4332 (1 987). DNA or RNA which encodes APP (both the 695 
and 751 amino acid forms) may be obtained as described in Selkoe etal.. Proceedings of the National Academy 
ofSdences(USA). 85:7341 (1988). Other host cell transfection is well known in the art Co-transfection of cells 
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jchniques. See, e.g. . Gorman et ah . Molecular an^ellular Biology . 

2:1044 (1982). 

Therefore, the present invention also provides a process for constructing a host cell capable of expressing 
5 SEQ ID NO:1, said method comprising transfecting a host cell with a DNA vector that comprises a DNA se- 
quence which encodes SEQ ID N0:1 . A preferred method utilizes 293 cells as host cells. These 293 cells may 
be obtained from the ATCC under the accession number ATCC CRL 1573. Another preferred method utflizes 
AVI 2 cells as host cells. AVI 2 cells may be obtained from the ATCC under the accession number ATCC CRL 
9595. Another preferred method utilizes yeast cells of the family Saccharomycetes or the bacterium E.coli as 
10 the host cells. 

The preferred process utilizes an expression vector which comprises SEQ ID N0:2 in 293 cells. Especially 
preferred for this purpose is pRc/Zyme. 

Another preferred process comprises (a) a DNA vector which comprises SEQ ID N0:2 and (b) a DMA ex- 
pression vector which encodes the APP coding sequence. A most preferred process utilizes the DNA vector 
15 pRc/Zynne. Transfected host cells may be cultured under conditions well known to skilled artisans such that 
SEQ ID NO:1 is expressed, thus producing Zyme in the transfected host cell. 

Additionally, the invention provides a process for identifying DNA homologous to a probe of the present 
invention, which comprises combining test nucleic acid with the probe under hybridizing conditions and Iden- 
tifying those test nucleic acids which hybridize. The preferred probes for use In this method are SEQ ID N0:3 
20 and SEQ ID N0:4. Hybridization techniques are well known in the art. See, e.g. . Sambrook, et a!. , supra . 

Assays utilizing the compounds provided by the present Invention are also encompassed within this in- 
vention. The assays provided determine whether a substance is a ligand for Zyme. said method comprising 
contacting Zyme with said substance, monitoring Zyme activity by physically detectable means, and Identifying 
those substances which interact with or affect Zyme. 
25 Preferred assays of the present invention incorporate a cell culture assay, a high performance liquid chro- 

nrwtography (HPLC) assay or a synthetic competition assay. 

Preferred cell culture assays utilize oocytes. AVI 2, E. coll . yeast or 293 cells which co-express nucleic 
acids which encode Zyme and APP. Those co-expressing cell culture assays which are preferred Include those 
which utilize 293/pRc/Zyme. A preferred assay utilizes yeast cells, and a DNA compound which encodes amino 
30 acids 587 to 606 of APP. One method of performing the yeast assay is described in Smith and Kohorn, Pro- 
ceedings of the National Academy of Sciences. USA , 88:5159 (1991). using Zyme-encoding DNA and APP- 
encoding DNA which comprises the Metsge/Aspsgy cleavage site codons. 

Most preferred oocyte assays co-express mRNA. Most preferred cell culture assays utilize Western blot 
analysis or radiolabelled APP as the physically detectable means. A preferred HPLC assay is one wherein the 
35 substrate utilized is a full length, eukaryoticaliy-derived APP. 

The most preferred synthetic competition assay is one wherein the substance competes with the Kunitz- 
like domain gene product for binding to Zyme. The most preferred Zyme/Kunitz domain competition assay is 
one wherein APP is labelled with radioisotope. 

The cell culture assays may be accomplished according to the procedures detailed by F. Ausubel. et al. , 
40 Current Protocols in Molecular Biology. (1989) at pages 9.1-9.5. The HPLC assay may be performed essen- 
tially as described in Hirs and Timasheff, eds, Methods in Enzymology , Volume 91. Sections V and V! (1983). 
The Zyme/Kunitz-Iike domain binding or competition assay may be performed as described by J. Bennet and 
H. Yamamura. Neurotransmitter Receptor Binding . (1985) Chapter 3. 

The present invention also provides a method for identifying or purifying Zyme. which comprises saturating 
45 test protein with anti-Zyme antibody, eliminating anti-Zyme antibody which fails to bind, and detecting the anti- 
Zyme antibody which remains bound. Antibody imaging techniques are known in the art. 

The following are examples of aspects of the present invention. These examples are illustrative only and 
are not intended to limit the scope of the invention in any way, 

so Example 1 

Production of Zyme in 293 cells 

Alyophilized aliquot of E. coli pSZyme can be obtained from the Northern Regional Research Laboratories. 
55 Peoria, llinois, USA 61604, under the accession number NRRL B-18971 and used directly as the culture In 
the process described below. This culture has been deposited with the NRRL 

Plasmid pSZyme was isolated from a culture of E. coli /pSZyme by cesium chloride purification. Plasmid 
pSZyme was then digested with Sa[l and Not l. The resulting fragment was linear. DNAIigase was used to ligate 
this Sall-NotI fragment and a Sall-Hindlll linker into a previously linearized pRc plasmid"^. (Invitrogen. catalog 
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#V750-20) 

Competent E. coli cells were then transfected withthe newly created pRc/Zyme vector which contained 
SEQ ID N0:2 and selected for those cells which contained the ampicillin resistance gene by growing on am- 
picillin-containing medium. 

After transfection of the pRc/Zyme vector into E.coli . a subsequent plasmid preparation was made in order 
to isolate the pRc/Zyme vector. In order to transfect 293 cells with the pRc/Zyme vector, the procedure devel- 
oped by Chen and Okayama was employed. C. Chen and H. Okayama. Molecular and Cellular Biology. 7:2745 
(1987). These cells were used in the cell culture assay as described in Example Z 

Selection on the anUbiotic G418 (geneticin) was included in this step to produce stable transformants in 
293 cells. The colonies which grew in the presence of G418 were then used as a source of Zyme. 

Example 2 

Cell Culture Assay 

Human embryonic kWney cells (293 cells) were co-transfected with pRcZyme and an APP-encoding vector. 
On one occasion, a vector encoding the 695 amino acid APP (whteh lacks a Kunitz-iike domain) was cobBns- 
fected with pRcZyme. On another occasion, a vector encoding the 751 amino acid APP (with the Kunitz-like 
domain) was cotransf acted with pRcZyme. 

Transfection was achieved using standard calcium phosphate fransfectton. Other transfection protocols, 
such as described by Sambrook. etal. . supra , are also effective. Amytoidogenicfragn>ents were detected when 
the 695 amino acid (without KPI) APP coding sequence was used, via Western Btot analysis, as described in 
Sambrook. et aL, supra, using antisera to the cartoxy-terminal amino acids of the APP protein. Anti BX6. as 
decribed in T Oltersdorf. et al.. Journal of Biological Chemistrv . 265:4492-4497 (1991). was used in this pro- 
cedure. Amyloidogentc fragments were not delected when the 751 amino acid (with KPI) APP was used. 

Example 3 

HPLC Assay 

Full length APP is produced in cells which have been infected with APP-encoding baculovirus. This pro- 
cedure is accomplished according to J, Knops. et al. , Journal of Btological Chemistry . 266:7285 (1991),. APP 
is then incubated in the presence of active Zyme and test compound. APP fragments are subsequently sepa- 
rated by high performance liquid chromotography. Each pooled fragment is then microsequenced using stan- 
dard, such as those of Hirs and Timasheff. eds. Methods in EnzynrK)loQv . Vol. 91 Sections V and VI, (1983). 
The quantity of amyloidogenic fragments (those which terminate at either Met^ or ASPsgj) generated are com- 
pared to the quantity generated in the absence of test compound to determine the ability of the test compound 
to affect Zyme. 

Example 4 

Zyme/Kunitz-like Domain Competition Assay 

A peptide representing the KPI domain of APP is synthesized and labelled with the isotope k>dine-125 
(i26|). Competition binding assays are then conducted according to J.P. Bennetand H. Yamamura. Neurotrans- 
mitter Receptor Binding 61 (1985). Zyme is then bound to plastic microtitre wells as in the traditional EUSA 
assay. One such typical protocol for this step Is described in F. Ausubel F., Current Protocols in Molecular Blol- 
ogy, 2:11.1-11.3 (1989). Radiolabelled KPI domain and unlabelled compeUtor compound is subsequently added 
to the wells of the 96-well microtitre plate. The wells are then washed. The remaining isotope is recorded in 
order to calculate the relative affinity of the unlabelled competitor compound to Zyme, 

Example 5 

Isolating the Genomic Clone 

A genomic library specific for human chromosome 19 genomic library in Charon 21 A bacteriophage was 
purchased from the American Type Culture Collection. 12301 Parklawn Drive. Rockville. Maryland, USA20852. 
(ATCC) (Catalog number 57711). These phage were transfected into E. coli K802 recA" host strain (Cat no. 



8 




EP 0 576 152 A1 




47026). The tilre of the phage was 6^.0 X 10* plaque forming units per microliter. A genomlcclone of the 
gene encoding Zyme was isolated by conventional screening of phage libraries (See, e.g. , Sambrook et al. . 
Molecular Cloning: A Laboratory Manual 2.6-2.114, 1989). 

A radlolabelled cDNA probe was synthesized utilizing the polymerase chain reaction (such as that descri- 
bed by Schowalter and Sommer, Analytical Biochemistry , 177:90-94, 1989) by specifically annealing SEQ ID 
NO: 5 and SEQ ID NO: 6 primers to an EcoRiyNotl purified (Bio-Rad Laboratories, P.O. Box 708, Rockville Cen- 
tre, New York USA, 11571. catalog number 732-6010) pRc-Zyme DNA fragment 

Hybridization and washing was carried out at eS^C as described in the Zeta-Probe™ blotting membrane 
instruction manual (Bio-Rad, catalog number 164-0153). Putative primary Zyme bacteriophage were stored in 
SM buffer containing 2-3 drops of chloroform. A single homogenous plaque (711-4) was subsequently isolated 
from a tertiary screen. Isolation of lambda bacteriophage DNA positive by in situ hybridization to Zyme was 
accomplished using standard techniques. 

Purified lambda phage Zyme DNA was digested with Hindlll and electrophoresed on a 1% agarose/TBE 
(0.1 M Tris-HQ pH 8.3, 0.1 M boric acid, 1 mM ethylenediaminetetraacetic acid) gel. Separated DNA was then 
transferred onto a Zeta-Probe™ blotting membrane (0.5x TBE running buffer, constant 80 volts for 1 hour) as 
described in section 2.5 of the Zeta-Probe*™ instruction manual using non-denaturing conditions, then dena- 
tured (0.4M NaOH for 10 minutes) as described in section 2.8 of the Zeta-Probe™ instruction manual. 

A radiolabelled probe encompassing the Bam HI/Xbal fragment of pRc/Zyme was used with a random 
primed DNA labelling kit (such as that which is commercially available by Boehringer Mannheim Corporatton, 
9115 Hague Road, P.O. Box 50414, Indianapolis, Indiana, USA 46250-0414, catalog number 1004760) to de- 
termine if the 3' coding sequence was found in our clone. Hybridization and washing to the above Zeta-Probe™ 
membrane was performed as previously described and autoradiography revealed homology to the 3* region 
of Zyme. 

To confirm that phage 711-4 contained the 5* Zyme coding region, the polymerase chain reaction using 
SEQ ID N0:7 and SEQ ID NO:8 was again utilized to specifically amplify a 470 base pair band from tertiary 
plaque purified chromosome 19 Zyme phage DNA according to Kainz, et a!. , Analyttoal Biochemistry , 202:46 
(1 992). This DNA fragment was purified, then subcloned into the pUC 19 expresston plasmid, described supra . 
The identity of the DNA sequences corresponding to sequences 1 to 33 of the 5' Zyme cDNA coding region 
and an additional 272 nucleotides upstream of the 5' Zyme coding region were confirmed by DNA sequence 
analysis, using standard techniques. 

Plasmid Deposits 

Under the provisions of the Budapest Treaty on the International Recognition of the Deposit of Microor- 
ganisms for Purposes of Patent Procedures the following culture has been deposited with the permanent cul- 
ture collection of the Northern Regional Research Center (NRRL), Agricultural Research Service, U.S. De- 
partment of Agriculture, 1815 N. University Street, Peoria, Illinois, 61604: 



Deposited Material 



Accession Number 



E. CQli 
K12/pSZyme 



NRRLB-18971 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: ELI LILLY AND COMPANY 

(ii) TITLE OF INVENTION: PROTEASE AND RELATED DNA COMPOUNDS 

(iii) NUMBER OF SEQUENCES: $ 

(iv) CORRESPONDENCE ADDRESS: 



(A) 


ADDRESSEE : 


K. G. TAPPING 


(B) 


STREET : 


ERL WOOD MANOR 


(C) 


CITY: 


WINDLESHAM 


(D) 


STATE: 


SURREY 


(E) 


COUNTRY : 


UNITED KINGDOM 


(F) 


ZIP; 


GU20 6PH 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 

(B) COMPUTER: 

(C) OPERATING SYSTEM: 

(D) SOFTWARE: 



Diskette 
Macintosh 
Macintosh 7.0 
Microsoft Word 



SEQ ID NO:l 

Met Lys Lys Leu Met Val Val Leu Ser Leu He Ala Ala Ala Trp Ala 

5 10 15 

Glu Glu Gin Asn Lys Leu Val His Gly Gly Pro Cys Asp Lys Thr Ser 

20 25 30 

His Pro Tyr Gin Ala Ala Leu Tyr Thr Ser Gly His Leu Leu Cys Gly 

35 40 45 

Gly Val Leu He His Pro Leu Trp Val Leu Thr Ala Ala His Cys Lys 

50 55 60 

Lys Pro Asn Leu Gin Val Phe Leu Gly Lys His Asn Leu Arg Gin Arg 
65 70 75 80 

Glu Ser Ser Gin Glu Gin Ser Ser Val Val Arg Ala Val He His Pro 

85 90 95 

Asp Tyr Asp Ala Ala Ser His Asp Gin Asp He Met Leu Leu Arg Leu 

100 105 110 

Ala Arg Pro Ala Lys Leu Ser Glu Leu He Gin Pro Leu Pro Leu Glu 

115 120 125 

Arg Asp Cys Ser Ala Asn Thr Thr Ser Cys His He Leu Gly Tro Glv 

130 135 140 

Lys Thr Ala Asp Gly Asp Phe Pro Asp Thr He Gin Cys Ala Tyr He 
145 150 155 160 

His Leu Val Ser Arg Glu Glu Cys Glu His Ala Tyr Pro Gly Gin He 

165 170 175 

Thr Gin Asn Met Leu Cys Ala Gly Asp Glu Lys Tyr Gly Lys Asp Ser 

180 185 190 

cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Gly Asp His Leu Arg 

195 200 205 

Gly Leu Val Ser Trp Gly Asn He Pro Cys Gly Ser Lys Glu Lys Pro 

210 215 220 

Gly Val Tyr Thr Asn Val Cys Arg Tyr Thr Asn Trp He Gin Lys Thr 
225 230 235 240 

He Gin Ala Lys 
244 
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SEQ ID NO: 2 

ATGA AGAAGCTGAT 14 

GGTGGTGCTG AGTCTGATTG CTGCAGCCTG GGCAGAGGAG CAGAATAAGT TGGTGCATGG 74 

CGGACCCTGC GACAAGACAT CTCACCCCTA CCAAGCTGCC CTCTACACCT CGGGCCACTT 134 

GCTCTGTGGT GGGGTCCTTA TCCATCCACT GTGGGTCCTC ACAGCTGCCC ACTGCAAAAA 194 

ACCGAATCTT CAGGTCTTCC TGGGGAAGCA TAACCTTCGG CAAAGGGAGA GTTCCCAGGA 254 

GCAGAGTTCT GTTGTCCGGG CTGTGATCCA CCCTGACTAT GATGCCGCCA GCCATGACCA 314 

GGACATCATG CTGTTGCGCC TGGCACGCCC AGCCAAACTC TCTGAACTCA TCCAGCCCCT 274 

TCCCCTGGAG AGGGACTGCT CAGCCAACAC CACCAGCTGC CACATCCTGG GCTGGGGCAA 334 

GACAGCAGAT GGTGATTTCC CTGACACCAT CCAGTGTGCA TACATCCACC TGGTGTCCCG 394 

TGAGGAGTGT GAGCATGCCT ACCCTGGCCA GATCACCCAG AACATGTTGT GTGCTGGGGA 454 

TGAGAAGTAC GGGAAGGATT CCTGCCAGGG TGATTCTGGG GGTCCGCTGG TATGTGGAGA 514 

CCACCTCCGA GGCCTTGTGT CATGGGGTAA CATCCCCTGT GGATCAAAGG AGAAGCCAGG 574 

AGTCTACACC AACGTCTGCA GATACACGAA CTGGATCCAA AAAACCATTC AGGCCAAG 632 

SEQ ID NO: 3 

ATG GCT GGC GGC ATC ATA GTC AGG G 25 
SEQ ID NO: 4 

AAC CGA ATC TTC AGG TCT TCC TGG GG 26 
SEQ ID NO: 5 

TCG CTC TCT CCT GGG GAC ACA GA 23 
SEQ ID NO: 6 

CCA GGT GCT ATT CCA TGT ATG TCA TAG 27 
SEQ ID NO: 7 

TCT GTG TCC CCA GGA GAG AGC GA 23 
SEQ ID NO: 8 

ATA GTG AAG CTG TCT TCT CAA T 22 



40 



Claims 

45 

1. An amino acid compound which comprises the amino acid sequence 



50 



55 
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Met Lys Lys Leu Met Val Val Leu Ser Leu He Ala Ala Ala Trp Ala 
15 10 15 

Glu Glu Gin Asn Lys Leu Val His Gly Gly Pro Cys Asp Lys Thr Ser 
20 25 30 

His Pro Tyr Gin Ala Ala Leu Tyr Thr Ser Gly His Leu Leu Cys Gly 
35 40 45 

Gly Val Leu lie His Pro Leu Trp Val Leu Thr Ala Ala His Cys Lys 
50 55 60 

Lys Pro Asn Leu Gin Val Phe Leu Gly Lys His Asn Leu Arg Gin Arg 
65 70 75 80 

Glu Ser Ser Gin Glu Gin Ser Ser Val Val Arg Ala Val He His Pro 
85 90 95 

Asp lyr Asp Ala Ala Ser His Asp Gin Asp He Met Leu Leu Arg Leu 
100 105 HO 

Ala Arg Pro Ala Lys Leu Ser Glu Leu He Gin Pro Leu Pro Leu Glu 
115 120 125 

Arg Asp Cys Ser Ala Asn Thr Thr Ser Cys His He Leu Gly Trp Glv 
130 135 140 H y 

Lys Thr Ala Asp Gly Asp Phe Pro Asp Thr He Gin Cys Ala Tyr He 
145 150 155 160 

His Leu Val Ser Arg Glu Glu Cys Glu His Ala Tyr Pro Gly Gin He 
165 170 175 

Thr Gin Asn Met Leu Cys Ala Gly Asp Glu Lys Tyr Gly Lys Asp Ser 
180 185 190 

Cys Gin Gly Asp Ser Gly Gly Pro Leu Val Cys Gly Asp His Leu Arg 
195 200 205 

Gly Leu Val Ser Trp Gly Asn He Pro Cys Gly Ser Lys Glu Lys Pro 
210 215 220 

Gly Val Tyr Thr Asn Val Cys Arg Tyr Thr Asn Trp He Gin Lys Thr 
225 230 235 240 

He Gin Ala Lys 
244 

hereinafter defined as SEQ ID N0:1, or a functional equivalent thereof. 

A nucleic acid compound which comprises an nucleic acid sequence which encodes for a compound of 
Claim 1 or a part thereof. 

A nudeic acid compound as claimed in Claim 2 which comprises the sequence 
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ATGAAGAAGC TGATGGTGGT GCTGAGTCTG ATTGCTGCAG CCTGGGCAGA 
GGAGCAGAAT AAGTTGGTGC ATGGCGGACC CTGCGACAAG ACATCTCACC 
CCTACCAAGC TGCCCTCTAC ACCTCGGGCC ACTTGCTCTG TGGTGGGGTC 
CTTATCCATC CACTGTGGGT CCTCACAGCT GCCCACTGCA AAAAACCGAA 
TCTTCAGGTC TTCCTGGGGA AGCATAACCT TCGGCAAAGG GAGAGTTCCC 
AGGAGCAGAG TTCTGTTGTC CGGGCTCTGA TCCACCCTGA CTATGATGCC 
GCCAGCCATG ACCAGGACAT CATGCTGTTG CGCCTGGCAC GCCCAGCCAA 
ACTCTCTGAA CTCATCCAGC CCCTTCCCCT GGAGAGGGAC TGCTCAGCCA 
ACACCACCAG CTGCCACATC CTGGGCTGGG GCAAGACAGC AGATCGTGAT 
TTCCCTGACA CCATCCAGTG TGCATACATC CACCTGGTGT CCCGTGAGGA 
GTGTGAGCAT GCCTACCCTG GCCAGATCAC CCAGAACATG TTGTGOXXrrc 
GGGATGAGAA GTACGGGAAG GATTCCTGCC AGGGTGATTC 'TCGGGGTCCG 
CTGGTATGTG GAGACCACCT CCGAGGCCTT GTGTCATGGG GTAACATCCC 
CTGTGGATCA AAGGAGAAGC CAGGAGTCTA CACCAACGTC TGCAGATACA 
CGAACTGGAT CCAAAAAACC ATTCAGGCCA AG 

or a functional equivalent or part thereof. 

4. A nudeic acid vector which comprises the nucleic acid compound of Claim 3. 

5. A DNA vector of Claim 4 which is pSZyme. 

6. A host cell transfected with a nucleic acid vector of Claim 4. 

7. A genomic clone of Zyme which comprises a 4.0 kilobase Hindlll fragment from a human chromosome 1 9 
library which hybridizes to fragments of DNA of the compound of dann 3 under conditions suitable for se- 
lective hybridization. 

8. A process for diagnosing Alzheimer's disease or a propensity to develop Alzheimer's disease in a patient 
which comprises 

a) securing DNA from said patient; 

b) digesting said DNA with a restriction enzyme; 

c) hybridizing said digested DNA with a labeled nucleotide sequence corresponding to the compound 
of SEQ ID N0:2, or a part thereof; and 

d) comparing pattern of hybridization to similarly-digested and hybridized band configurations of those 
members of the donor's family who display or have displayed the symptoms of Alzheimer's disease. 

9. An assay for determining whether a test substance is a functional ligand for a protein of SEQ ID N0:1. 
said method comprising 

a) contacting the protein with said test substance; 

b) monitoring the protein's activity by physically detectable means; and 

c) identifying those substances which interact with or affect the activity of the protein relative to a con- 
trol which receives no test substance. 

10. A method for expressing a nucleic acid sequence as claimed in Qaim 2 in a transfected host cell, said 
method comprising culturing said transfected host cell under conditions suitable for gene expression. 
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